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ABSTRACT 

We present 90, 140, and 268 GHz sub-arcminute resolution imaging of the Sunyaev-Zel'dovich effect 
(SZE) in the disturbed, intermediate redshift [z = 0.5458) galaxy cluster MACS J0717. 5+3745, a 
triple-merger system comprising four distinct, optically-detected subclusters. Our 90 GHz SZE data 
result in a sensitive, 34 /iJy beam -1 map of the SZE at 13" effective resolution using the MUSTANG 
bolometer array on the Green Bank Telescope (GBT). Our 140 and 268 GHz SZE imaging, with 
resolutions of 58" and 31" and sensitivities of 1.8 and 3.3 mJy beam -1 respectively, was obtained 
through observations from the Caltech Submillimeter Observatory (CSO) using Bolocam. We com- 
pare these maps to a 2-dimensional pressure map derived from Chandra X-ray observations. Our 
MUSTANG SZE data confirm previous indications from Chandra of a pressure enhancement due to 
shock-heated, > 20 keV gas immediately adjacent to extended radio emission seen in low-frequency 
radio maps of this cluster. MUSTANG also detects pressure substructure that is not well-constrained 
by the Chandra X-ray data in the remnant core of a merging subcluster. We find that the small-scale 
pressure enhancements in the MUSTANG data amount to ~ 2% of the total pressure measured in 
the 140 GHz Bolocam observations. The X-ray inferred pseudo-pressure template also fails on larger 
scales to accurately describe the Bolocam data, particularly at the location of the subcluster with a 
remnant core known to have a high line of sight optical velocity of ~ 3200 km s -1 . Our Bolocam data 
are adequately described when we add an additional component - not described by a thermal SZE 
spectrum - to the X-ray template coincident with this subcluster. Using flux densities extracted from 
our model fits, and marginalizing over the X-ray spectroscopic temperature constraints for the region, 
we fit a thermal + kinetic SZE spectrum to our Bolocam data and find the subcluster has a best-fit 
line of sight proper velocity v z = 3600+2160 km s -1 , in agreement with the optical velocity estimates 
for the subcluster. The probability v z < given our measurements is 2.1%. Repeating this analysis 
using flux densities measured directly from our maps results in a 3.4% probability v z < 0. We note 
that this tantalizing result for the kinetic SZE is on resolved, subcluster scales. 

Keywords: X-rays: galaxies: clusters: general - cosmology: observations - cosmic microwave back- 
ground 



1. INTRODUCTION 

Massive (> 10 15 M©) galaxy clusters are the largest 
gravitationally-bound objects in the Universe, and their 
formation is thought to be driven by mergers of smaller 
clusters in what are the most energetic events to take 
place since the Big Bang (Sarazin 2005). Our under- 
standing of the astrophysics of mergers and the intra- 
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cluster medium (ICM) has tradi tionally been advanced 
through X-ray observations (e .g. iMcNamara et all 120051 : 
iMarkevitch fc V ikhlinini I2007T) . which are sensitive to 
both the density and temperature of the gas. How- 
ever, sensitive, high angular resolution imaging of the 
Sunyaev-Zel'd ovich Effect (SZE) ha s recently become 
possible (e.g . iKitavama et all [2004] iNord et all 120091: 
Mason et all 120101 : iKorngut et all 120111 : iPlagge et all 
20121) . helping to yield a more complete view of the com- 
plex processes in the ICM, particularly at high redshift. 

The SZE is due to inverse Compton scattering 
of Cosmic Microwave Background (CMB) photons to 
higher energies on average by h ot electrons in the 
ICM (jSunvaev fc Zel'dovichl [1971 : for reviews of the 
SZE, including its the rmal and kinetic comp onents, see 
iBirkinshawl (j 19991) and lCarlstrom et all ((2002). The SZE 
complements X-ray and optical observations, offering 
some of its own unique advantages. First, unlike intrin- 
sic emission mechanisms (e.g. X-ray, optical, radio) from 
a cluster, the SZE does not suffer cosmological surface 
brightness dimming (cx (1 + z)~ A ). Second, the thermal 
SZE (tSZE) is proportional to the line of sight integrated 
thermal electron pressure, while X-ray observations are 
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sensitive to the ICM density-squared. The different line 
of sight dependences of X-ray and SZE data allow one 
to infer information about the line of sight properties 
of the ICM. Third, there is the potential to constrain 
the component of ICM proper velocity along the line of 
sight using a Doppler shift of the CMB known as the 
kinetic SZE (kSZE). The combination of the SZE obser- 
vations with those from radio, optical, X-ray, and other 
wavebands can provide a more complete understanding 
of cluster mergers and other astrophysical processes. 

A particularly strikin g example of a cluster merger 
is MA CS J0717.5+3745 (lEbeling et aLl[200ll : lEdge et all 
12001 . At a redshift z = 0.5458, MACS J0717.5+3745 
is the ho ttest member of th e MAssive Cluster Survey 
(MAC S; lEbeling et afll200ll) at z > 0.5; lEbeling et all 
(|2007f l report an average X-ray spectroscopic tempera- 
ture fc B T x = 11.6 ± 0.5 keV, determined within an over- 
density of 1000 times the critical density of the universe 
and by excluding the central 70 kpc of the cluster, and 
a velocity dispersion a ~ 1660 km s -1 within a 1 Mpc 
aperture. Optical imaging and spectroscopy have shown 
that the cluster is located at the end o f a 4 hm Mpc 
long filamentary overdensity of galaxies (Ebcling et al. 
2004), consistent with the expectation tha t clusters 
form at int ersections in the cosmic web. iMa et ail 
(|2008l I2009D have also shown that the cluster itself com- 
prises four distinct groups of galaxies and appears to 
be a rare tri ple-merger i n pro g ress. Low-frequency ra- 
dio images (lEdge et all 120031 : Ivan Weeren et all 120091 : 
iBonafede et all 12009) show the presence of diffuse radio 
emission as well, indicative of a radio relic or halo. 

In this paper we compare multi-wavelength observa- 
tions of the SZE with X-ray, radio, and lensing ob- 
servations. Our SZE imaging data include 90 GHz 
data collected wit h the MUSTANG bolometer array 
picker et al.ll2008D on the Green Bank Telescope (GBT), 
along with 140 an d 268 GHz data collected with Bolocam 
(|Haig et al.ll2004l) from the Caltech Submillimeter Obser- 
vatory (CSO). The MUSTANG data have a resolution 
of 13" - due to the 8.5" instrument resolution and 10" 
smoothing of the maps - and a sensitivity of ~ 34 /zJy 
beam -1 within a central radius of ~ 40". The Bolocam 
data have resolutions of 58" and 31" at 140 and 268 GHz 
- on opposite sides of the null in the thermal SZE spec- 
trum - and have sensitivities of 1.8 and 3.3 mJy beam -1 , 
respectively. The Bolocam data probe scales as large as 
14', while the more sensitive MUSTANG data do not re- 
liably probe scales > 1'. Additionally, we compare these 
observations to a CARMA/SZA 31 GHz observation of 
this cluster. The short baselines of CARMA/SZA probe 
scales up to 12' with a 2' synthesized beam. The longer 
baselines of the CARMA/SZA, with a ~ 10" synthesized 
beam, lack the sensitivity to detect SZE on these scales, 
but provide constraints on the locations and spectral 
indices of the potentially contaminating compact radio 
source population. We therefore use the CARMA/SZA 
data to place constraints on the large angular scale clus- 
ter properties (the "bulk" SZE flux) and to help constrain 
contamination by radio sources. 

The organization of the paper is as follows. In $2] 
we provide an overview of results from previous optical, 
low frequency radio, and X-ray observations of MACS 
J0717.5+3745. In we describe the MUSTANG, Bolo- 



Surface Mass Distribution (k) 
0.5 1 




37 43 30 



07 n 17 



Right Ascension 



Figure 1. Surface mass distribution in units of the critical sur- 
fa ce density (converge nce, k) for MACS J0717. 5+3745 determined 
by Zitrin et al. (2009) through strong lensing measurements (color 
image). Black circles correspond to the four main peaks in the 
cluster light distribution as identified by Ma et al. (2009). The 
magent a contours (3, 6, 12, 24- a) reproduce the 610 MHz GMRT 
data in van Weeren ct al. 2009. The blue contours reprodu ce the 
(5,7) xlO 10 M Q arcsec ^contours in ILimousin et all 120121). We 
note t hat recent redshift information reported in ILimousin et ail 
(2012) reduces the overall mass estimate, but that the features 
otherwise show broad agreement. 

cam, CARMA/SZA, and Chandra X-ray observations 
and data reduction. 2] gives the details of modelling of 
the SZE observable properties and the X-ray data prod- 
ucts used in this analysis. In <j5]we use the Bolocam 140 
& 268 GHz data, along with our X-ray temperature con- 
straints, to infer the peculiar velocity of one subcluster 
component with a spectrum that is not well described 
by the thermal SZE alone, and we compare these esti- 
mates to those for the most massive subcluster in this 
system. We present our conclusions from this multi- 
wavelength SZE study in SjfjJ Throughout this paper, 
we adopt a flat, A-dominated cosmology with Q M = 0.3, 
£1a = 0.7, and Hq = 70 km s -1 Mpc -1 consistent with 
recent Wilkinson Microwave Aniso tropy Probe (WMAP) 
results (|Komatsu et al.H2009l[20TU ). 

2. PREVIOUS ANALYSES OF MACS J0717.5+3745 

MACS J0717. 5+3745, a co mplex merging system dis - 
covered in the MACS survey (jEbehng et/alll200lL[20l)7h . 
is among the best-studied massive clusters at redshift 
z > 0.5, with observations spanning a broad range of 
the electromagnetic spectrum. In support of our multi- 
wavelength SZE study, we compare our MUSTANG and 
Bolocam observations to X-ray, optical, and low fre- 
quency radio data, and to the results of the many studies 
that have relied upon these data. 

From X-ray and optical analyses, IMa et all 
(2009) identify four distinct components in MACS 
J0717. 5+3745. These subcluste rs are shown in Fig ure [T] 
on the strong lensing data from lZitrin et a l. (2009!), who 
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note that this cluster has the largest known Einstein 
radius, 6 e ~ 55". iZitrin et al.l 120091 cite this large 
Einstein radius and the shallow surface mass distribu- 
tion as further evidence tha t the cluster is disturbed, 
while ILimousin et al.l ([2012D characterize this as "one 
of the most disturbed clusters presently known" in 
their recent stron g lensing analysis. We note that the 
lensing analysis of Zitrin et al.l l2009i assumed a redshift 
z ~ 2.5 for the primary lense d system. The recent 
redshift information reported in ILimousin et al.l (|2012f ) 
entails a shift in the normalization of the surface mass 
distribution k, reducing the overall mass estimate for 
MACS J0717. 5+3745. The analysis presented here only 
relies on the fact that lensing has located four mass 
peaks, and that they show good agreement with the 
r egions optically identified bv lMa et al.1 (|2009f ). 

iMa et ail 120091 provide the following interpretation of 
the four dominant mass components in this merging clus- 
ter. Subclustcr A is the least massive and is likely falling 
back into the main cluster from the NW, after having 
passed through once already. Subclusters B and D are 
likely remnant cores that survived an initial encounter 
in a merger along an axis inclined much more toward the 
line of sight. Subcluster C, which is the most massive 
component in MACS J0717. 5+3745, exhibits good 
X-ray/optical alignment and is most likel y the disturbed 
core of the main cluster. IMa et aLll2009l also report the 
line of sight (optical) spectroscopically-determined ve- 
locities for subclusters A, B, C, & D as (v A , v B , v c , u D ) = 

(+278jS»,+3238±«g,-733l«g,+83llgg) km s'\ 
We note the remarkably high line of sight velocity for 
subcluster B (> 0.01 c). 

The Giant Metrewave Radio Telescope (GMRT) ob- 
served MACS J0717.5+3745 at 610 MHz for a total of 4 
hours (jvan Weeren et al.l[2009l ). These GMRT observa- 
tions reveal a powerful radio halo with a spectral index 
a = —1.25, as well as a 700 kpc wide sub s truct ure identi- 
fied a s a radio relic by | va n Weeren et aT1(|2009D . Accord- 
ing to Ivan Weeren et al . 2009, the progenitor of the radio 
relic is likely a merger-driven shock wave within the clus- 
ter which has accelerated electrons via the Diffuse Shock 
Acceleration (DSA) mechanism. The radio substructure 
brackets the high temperature regions of the ICM (see 
Figure 131 and is oriented perpendicular to the merger 
axes and large-scale filament, which supports the relic 
s cenario. 

iBonafede et al.l ([2009) performed high and low reso- 
lution observations of MACS J0717.5+3745 in full po- 
larization mode with the Very Large Array (VLA), at 
frequencies spanning 1.365-4.885 GHz. They measure 
polarizations up to 20% in the radio substructure and 
find no sharp discontinuity of the E-vectors between the 
large scale halo and the putative relic, which is expected 
for a true relic (|Clarke fc En sslin 2006). Bonafed e et al.l 
2009 find that the observed lack of Faraday rotation does 
not agree with the expectation for a relic produced by a 
merger shock near the center of a cluster. Lastly, there is 
no steepening of the spectral index across the short axis 
of the substructure, as woul d be expected of a radio relic 
following a merger. Thus, Bonafe de et al.l argue that 
the substructure is not a radio relic, but rather a bright, 
polarized filament connected with the radio halo. 

The competing interpretations of the low frequency ra- 



dio data both support the hypothesis that merger activ- 
ity has produced a relativistic, nonthermal component to 
the ICM of MACS J0717.5+3745, not seen in the Chan- 
dra X-ray observations implying it has a relatively low 
density typical of radio relics/halos. 

3. OBSERVATIONS & DATA REDUCTION 
3.1. MUSTANG Observations 

MUSTANG is a 64 element bolometer array operat- 
ing at 90 GHz on the 100-m GBT with a resolution of 
8."5. For more inf ormation about MUSTANG, refer to 
IDicker et al.l (|2008h Fl 

Between October 2010 and February 2012 we observed 
MACS J0717.5+3745 for a total of 15 hours on source 
over the course of 11 sessions. Observations were carried 
out using an on-the-fly scan strat egy similar to that de - 
scribed in lMason et~atl (|2010f ) and lKorngut et al.l (|201lD . 
For this, we move the telescope in a "lissajous daisy" pat- 
tern with a 3' radius and slowly nutate the center of the 
daisy pattern to increase the coverage. Seven different 
pointing centers were used to further expand the cover- 
age of the central part of the cluster. 

MUSTANG data were reduc ed and calibra t ed ac cord- 
ing to procedu r es de scribed in iMason et al.l (|2010fl and 
iKorngut et al.l ([201 If ). The overall measurement errors 
and uncertainties in both the temperatures of Uranus 
and Mars, and the shape of the GBT's beam, limit our 
flux calibration to 10% uncertainty in the absolute flux 
scale. The data presented here are fully common-mode 
subtracted; that is, we remove the instantaneous signal 
common to the detector array. This has the benefit of 
removing atmospheric noise at the expense of attenuat- 
ing signals from features with angular scales >1'™ This 
limits the spatial dynamic range of the MUSTANG data 
presented here. Two largely independent implementa- 
tions of the map-making pipeline exist, and have different 
approaches for common-mode subtraction, residual noise 
filtering, and data flagging. The features in Figure [2] are 
recovered by both pipelines with consistent brightness 
levels. 

The MUSTANG map in the top panel of Figure [J 
has three main features, two due to the cluster and 
one due to a foreground elliptical galaxy. The two ex- 
tended SZE decrements have peaks of —169 ± 37 and 
— 188 ± 30 fiJy beam -1 , and are detected at 4.6 and 
6.2-cr significance at their peaks. The foreground galaxy, 
which is not co-spatial with the SZE in the map, is re- 
solved by MUSTANG and has an integrated flux den- 
sity of +2.8 ± 0.2 mJy (13.7-cr at the peak) and an ex- 
tended shape of 14.4" x 16.1". The source is also de- 
tected at other wavelengths, including Chandra, GMRT, 
VLA (FIRST/NVSS), CARMA/SZA, and optical obser- 
vations to name a few. 

The radio sources detected in our maps at >3-er, such 
as the aforementioned foreground galaxy in the MACS 
J0717. 5+3745 field, can be modelled and subtracted from 
our data. Provided the source has a significant detection, 
an initial estimate of the position, flux, and, if appropri- 
ate, shape of the source can be estimated from a first- 

1 http : / /www . gb . nr ao . edu/must ang/ 

2 The transfer function reaches half the power of that in the 10- 
30" range, where it is flat, at ~ 60". See lMason et al.l pOToT ) for 
more details. 
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Figure 2. MUSTANG maps of MACS J0717.5+3745 with con- 
tours starting at ±3-<r, spaced at 1-cr intervals for the decrement 
(red) and at 3-cr intervals for the positive signal (blue). The effec- 
tive resolution of the map, which is 13" due to the instrument beam 
(8."5 FWHM) and image smoothing with a 10" FWHM Gaus- 
sian, is depicted in the lower left corner of each map. The noise 
level in the inner ~ 40" radial region is 34 fi.]y beam -1 , and is 
53 fijy beam -1 within the inner 3' radial region. Due to our map- 
making process, features on angular scales > 1' are attenuated. 
The largest extended SZE signal in this high-pass filtered view is 
located near subclusters C & D, while a secondary SZE feature is 
also detected near B (see text for details). Upper: MUSTANG 
map of MACS J0717.5+3745 before modelling and removal of the 
foreground source. Lower: MUSTANG map with foreground ra- 
dio source modelled and removed from the time-ordered data. Lo- 
cation of the removed source is marked with a magenta, dashed 
circle. Contours for the decrement are the same as above, but 
are now spaced at 1-cr intervals for positive emission in the radio 
source-subtracted map (starting at +3-<r). 



pass map of the MUSTANG data. This model is then 
subtracted from the time ordered data before making a 
new, point-source subtracted map (see lower panel of Fig- 
ure [2j . If necessary, the source model convolved by the 
beam can be updated to better remove any residual flux 
after the first-pass point source removal. This method 
has the advantage over simply subtracting the source 
from the map in that the true shape and size of the source 
can be recovered while accounting for non-linearities of 
the map-making routine (e.g., the influence of the source 
on the common mode subtraction and weight estimates). 
One drawback to this method is that it subtly alters the 
noise estimate in the S/N maps shown in Figure[5J How- 
ever, as we show in £14.2.11 this has a negligible impact 
on flux estimates from the MUSTANG data. 

3.2. Bolocam Observations 

Bolocam is a 144 element bolometer array capable of 
operation at either 140 or 268 GHz. Operating from 
the 10-m CSO, it provides resolutions of 58" and 31" at 
140 and 268 GHz, respectively, over an 8' instantaneous 
field of view. A complete description of th e Bolocam 
instrument can be found in iHaig et all (|2004f ) . 

We used Bolocam to observe MACS J0717. 5+3745 for 
a total of 12.5 hours at 140 GHz (6 hours in February 
2010 and 6.5 hours in October 2010) and for a total of 
8 hours at 268 GHz (3 hours in September 2011 and 
5 hours in November 2011). Due to the large instan- 
taneous field of view, these data have uniform coverage 
over the roughly 4' x 4' region in which we are inter- 
ested, with noise levels of 1.8 and 3.3 mJy beam -1 for 
the 140 and 268 GHz data, respectively. These data 
were reduced acco rding to the procedures described in 
ISavers et all (l27)TTI) using the updated calibration model 
of Sa vers et al.l ( 2012I ). We briefly summarize the reduc- 
tion below. 

We used frequent observations of the nearby quasars 
to obtain pointing corrections accurate to 5". We de- 
termined our flux calibration, with 5% and 10% un- 
certainties at 140 and 268 GHz, using observations of 
Uranus and Neptune. We subtracted atmospheric noise 
from the data via a common mode template and a time- 
stream high-pass filter with a characteristic frequency 
of 250 mHz. The largest scales recovered after filtering 
(at either observational frequency) are 14'. This noise 
subtraction also removes cluster signal, and we deter- 
mined the effective transfer function of our data process- 
ing by reverse-mapping a simulated cluster profile into 
our time-stream data and running it thro ugh the entire 
pipeline (for details see lSavers et al.ll2~011l) . We note that 
the transfer functions were determined independently for 
the 140 and 268 GHz data, and are slightly different from 
each other. 

This data reduction results in a high-pass filtered im- 
age of the astronomical signal. Therefore, prior to com- 
paring any model of the SZE signal to our data we must 
first convolve the model with both our PSF and the 
data-processing transfer function. Model fitting is other- 
wise straightforward, and the map noise is approximately 
white and is well described by a diagonal noise covariance 
matrix. Alternatively, we can also deconvolve the effects 
of the data-processing transfer function to obtain an un- 
biased image of the astronomical signal (aside from the 
effects of our PSF). Although the deconvolution ampli- 
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fi.es the large scale noise in the image, it allows us to 
directly compute total flux densities from within an ar- 
bitrary aperture. In this manuscript we exclusively use 
our processed data maps for all SZE model fitting to our 
Bolocam data, and we use our deconvolved data maps 
to obtain aperture-integrated fluxes which we compare 
to the model fit values. We note that in both cases, in 
order to fully account for any non-idealities in our noise, 
we determine all of our uncertainty estimates via 1000 
statistically-independent noise realizations. 

3.3. CARMA/SZA Observations 

Th e Sunyaev-Zel'dovich Array (SZA; iMuchovei et al.l 
12007ft . is a subarray comprising eight 3.5-m antennae 
in the Combined Array for Millimeter-wave Astronomy 
(CARMA). CARMA, including the SZA, is located at 
an altitude of 2200 m in the Inyo Mountains of Califor- 
nia. The SZA is capable of observing in three modes: 
separately, paired with the larger 6.1 and 10-m antennae 
of CARMA, and as part of the full CARMA 23-element 
heterogeneous array (see iPlagge et al.l 120121 for recent 
observations with the full array at 90 GHz). 

Our observations use the 31 GHz SZA as an in- 
dependent system in a compact configuration sensi- 
tive to 1-6' scales with a 10.'5 FWHM primary beam, 
which determines the field of view (for more details see 
IMuchovei et al.ll2007f ). The spatial filtering of the inter- 
ferometer allows small scale positive point source emis- 
sion to be separated from the large-scale, negative SZE 
signal at th ese frequencies (fo r an example in a similar 
context see lReese et al.l [20021 ) . The compact configura- 
tion provides two outrigger antennae (for a total of 13 
of the 28 baselines) that probe ~ 10" scales, though the 
rms noise level (~ 0.2 mJy) is not low enough to measure 
the SZE, which is a factor of ~ 6x lower at 31 GHz than 
at 90 GHz. Our SZA 31 GHz o bservations are r educe d 
using the pipeline described in IMuchovei et all (|2007| ). 
and were calibrated using the IRudvl (11987) model for 
Mars. The synthesized beam of the inner six elements 
of the SZA, which is in a compact configuration to opti- 
mize cluster imaging on large scales, was 118 x 131" for 
these observations. We use the SZA observations to con- 
strain the bulk, arcminute-scale SZE decrement and to 
leverage the spectral indices of potential contamination 
by compact radio sources. 

3.4. Chandra X-ray Observations 

The Chandra X-ray Observatory made two ACIS-I ob- 
servations of MACS J0717. 5+3745 for a total exposure 
time of 81 ksec (Obs IDs 1655 and 4200). We have re- 
duced the data using CIAO version 4.3 and calibration 
database (CALDB) version 4.4.5. Starting with the level 
1 events file, standard corrections are applied along with 
light curv e filtering and oth er standard processing (for 
details sec Reese et al. 2010). A wavelet-based detection 
algorithm is used to find point sources, which is then 
used as the basis of our point source mask. We discuss 
the data products we generate from our calibrated Chan- 
dra X-ray data in i j4.ll 

4. THERMAL ANALYSIS OF THE ICM 

We do not expect the assumption of spherical symme- 
try to be an accurate description of this merging clus- 
ter on subarcminute scales. This motivates the use of 



X-ray data to produce a 2-D template for the thermal 
SZE signal in our observations. X-ray data can be used 
to infer electron pressure, P c = n c fc B T c , the thermody- 
namic property to which the tSZE is linearly sensitive. 
X-ray spectroscopy provides a measure of temperature 
fc B T c , while X-ray imaging (i.e. surface brightness 5 X ) is 
proportional to the line of sight integral of the electron 
density squared, n 2 , times the X-ray cooling function A oc 

(oc Tj-I 2 ). Resolved temperature and pseudo-pressure 
maps, so-called because X-ray data alone do not con- 
strain the line of sight depth of the cluster, are common 
in detailed X-ray studies of ICM ther modynamics (see 
e.g. IMa et al J 120091: IRussell et afllMcj) . 
The intensity of the thermal SZE is 
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(1) 



where the primary CMB intensity normalization is 
Io = 2(k B T CMB ) 3 (hc)- 2 = 2.7033' x 10 8 Jy sr^ 1 (= 
22.87 Jy/arcmin 2 ), and T nMR is the tem perature of the 
CMB. Following ICarlstrom et al.l ((2002), the function 
g(v,T B ) describes the frequency dependence o f the tSZE. 
We include the relativist ic corrections of (Ito h et al.1 
U998tlItoh fc Nozawall200l . The Compton y parameter 
in Equation Q] is defined as 



.fe,T„ di = 



P P dt 



(2) 



where er T is the Thomson cross section, m c c 2 is the rest 
energy of the electron, the integration is along the line-of- 
sight £, and we have used the ideal gas law (P c = n e k B T e ). 

4.1. X-ray Data Products 

To build the thermal SZE templa te, we first bin the re- 
duced Chandra data using contbin (|Sandersll2006l ) , which 
uses the X-ray surface brightness to select regions of the 
image large enough to obtain a desired S/N level. For 
the temperature maps, we chose regions of the X-ray 
surface brightness image with S/N > 45. Each bin pro- 
vides independent measurements of surface brightness 
Sx, temperature k B T a , and metallicity Z. For the spec- 
tral analysis to determine fc B T c and Z, data extracted 
from each region are fit jointly using spectra and response 
files produced from each individual observation. Regions 
containing point sources are excluded from the extrac- 
tion of the spectra and computation of the response files. 
XSPEC is used to perform a joint spectral fit to both 
datasets over the 0.7-7.0 keV energy range, linking tem- 
perature and metallicity between the data sets but al- 
lowing the normalizations to vary. The masked regions 
of the data product maps are then filled in via a simple, 
bilinear interpolation scheme. 

Using the Chandra-derived k B T c and Z maps, we com- 
pute the cooling function, A ee (T e , Z), as a function of 
map bin, and use it to compute a line of sight integrated 
pseudo-pressure map, where 



P c di cx k B T c \ 



/4^(l + z) 3 ff x l 
A CC (T ,Z) 



(3) 



Here S x (in cts cm -2 s _1 sr _1 ) is the X-ray surface 
brightness, and A„(T C ,Z) (in cts cm 3 s -1 ) contains the 
additional factor of (1 + z)^ 1 required by cosmological 
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Figure 3. A comparison between MUSTANG SZE detections and the thermodynamic properties of MACS J0717. 5+3745 inferred from 
Chandra X-ray data. MUSTANG SZE detections from lower p anel of Figure [2] are overla id (black contours, except for lower right panel). 
GMRT radio contours are overlaid in green (reproduced from van Wccrcn ct al. (2009)). Upper Left: X-ray surface brightness map 
(shown on a scale <x <JSx-)- Upper Right: X-ray temperature map. The MUSTANG peak detections of the SZE are located both where 
the hottest gas is found in the Chandra data (C & D) and at the location of subcluster B. Lower Left: Pseudo Compton-j; map. This 
rescaling of the pseudo pressure map aids in the interpretation of the SZE data, and serves as a 2-D template for modelling the thermal 
SZE ( P~2l) . Lower Right: Map of the entropy parameter K = k B T B /n a 2 / 3 in MACS J0717.5+3745. 



dimming, due to redshifting of the photon energy. The 
factor t is an effective electron depth of the cluster along 
the line of sight, taken to be a single value over the map. 
Surprisingly, this is consistent with the average slopes 
found when using simple / 3-models to describe the den- 
sity and pressure p rofiles (|Sarazinlll988t iLaRoque et al.l 
l2006HPlagge et al.ll201(t ): a justification for this assump- 
tion may be found in the Appendix. We note that this 
does not imply pressure is constant along the line of sight, 



S x 



but rather that the average ratio of Compton-y to 
is approximately constant azimuthally. 

We refer to this X-ray template for the thermal SZE, 
which is simply a rescaling of the X-ray pseudo-pressure 
map by \fl, as a "pseudo Compton-y map." In construct- 
ing this, we have assumed Equation [2] can be approxi- 
mated as y — <T T /(m c c 2 ) J P c d£ w a T /(m a c 2 )P c £. The 
key ingredient to building this template is the use of an 
SZE measurement on large angular scales to infer £. By 
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Figure 4. These maps show the ±l-er statistical error bars for 
the temperature map presented in Figure [3] Typical errors bars 
are +3/-2 keV for most regions, except for the hottest regions where 
subclusters C & D are interacting and the gas temperature is sig- 
nificantly out of Chandra's energy band. 



using the integrated SZE flux, Yt SZ E = Jydfl, from the 
Bolocam 140 GHz and SZA 31 GHz observations to nor- 
malize the sum over the pixels in the pseudo Compton- 
y map, we determine the median effective depth for the 
cluster to be I ss 1.4 Mpc. The resulting maps are shown 
in Figure [31 Adopting this median I = 1.4 Mpc would 
impact the model prediction for the thermal SZE in the 
MUSTANG observations on the 20% level. However, we 
effectively marginalize over the value of £ during the fits 
to the Bolocam data by allowing the normalization of the 
model to vary. 



The comparison of X-ray pseudo-pressure to high- 
resolution observations of the SZE is an important first 
step toward moving beyond simple, spherical models. 
This is especially important for merging clusters, such 
as MACS J0717. 5+3745, that exhibit complicated ther- 
modynamics and ICM distrib utions. A similar ap proach 
has recently been applied by iPlagge et al.l (|2012f) in the 
analysis of high-resolution SZE observations. There are, 
however, several systematics that could affect the com- 
parison of X-ray pseudo-pressure to SZE data. First, 
we have implicitly assumed that the temperatures in our 
contbin map are constant in each bin, both in the plane 
of the sky and along the line of sight, while errors on 
the temperatures ca n be a s large as ~ 25% (see Figure 
[4] and discussion in i|4.2.1[) . Binning limits the spatial 
resolution in the resulting 2-D SZE template, while the 
temperature distribution along the line of sight could af- 
fect the normalization in each bin. Second, density sub- 
structure effects such as clumping ((nf) > (n c ) 2 ) and the 
presence of a cool core would bias our density estimates 
from the X-ray toward higher values, while the high lu- 
minosities of the cooler, denser clumps would bias the 
X-ray spectroscopic temperatures towards lower values. 

Using the same X-ray data products used for the 
pseudo Compton-y map, we also produce maps of the en- 
tropy distribution in this cluster. We adopt the entropy 
parameter K = fc B T c n+ 2 / 3 [ keVcm 2 ] commonly us ed in 
cluster astrophysics (see e.g. iCavagnolo et aL| [2009). We 
approximate the (pseudo-)density 



/4^(l + z)3ff x 
£A ee (T e ,Z) : 



(4) 



where £ is that inferred from the SZE data. The maps in 
Figure [3] of the projected 2-D thermodynamic distribu- 
tion imply that the highest pressure, hottest, and most 
entropic region is associated with the merger between C 
& D, while the remnant core of subcluster B exhibits 
a local entropy minimum. Here the high local pressure 
substructure of B is due to its high density, suggesting 
this component, which has a high line of sight velocity, is 
relatively intact and has probably approached or passed 
through the cluster with a high impact parameter. 

4.2. Thermal SZE Analysis 

Figure [5] shows the significance contours from each 
of our SZE observations, overlaid on the X-ray pseudo 
Compton-w map smoothed to the resolution of each in- 
strumentjj Qualitatively, the maps at 90 and 140 GHz 
agree with the X-ray-derived pseudo Compton-y maps. 
There are two small-scale pressure peaks that are co- 
spatial with the MUSTANG detections of pressure sub- 
structure, and the 140 GHz data and the 2-D model ( "the 
tSZE template") broadly agree. However, there are two 
main discrepancies: the MUSTANG data show signifi- 
cant levels of substructure - particularly near B - not 

3 Figure \E\ is included only to facilitate the qualitative cross- 
comparison of our multi-wavelength SZE observations. These 
pseudo Compton-j/ maps are merely smoothed to each instrument's 
resolution, and the transfer function is not accounted for in the 
maps in Figure \E\ in order to more accurately show the underlying 
pseudo Compt on -?/ pre diction. Figures [6] & [7] and the analysis in 
Sections 14.2.11 & 14.2.21 do include the transfer function (i.e. the 
model there is processed in the same way as the data). 
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Figure 5. Contours showing the SZE decrement (increment) for microwave observations below (above) ~218 GHz. Each observation is 
overlaid on the X-ray pseudo Compton-y map smoot hed to the resol ution of the instrument (FWHM depicted in the lower left corner of 
each panel). The subcluster components identified by Ma ct al. 2009 are labelled as in Figure [T] 

Left: MUSTANG observation with point sources subtracted, overlaid on a 10" FWHM smoothed map. Decrement contours are overlaid 
in blue at the (3, 4, 5, 6)-(j levels. Note that the MUSTANG data overlaid are filtered with the instrument's transfer function, while the 
background image is simply smoothed to 10". 

Middle: Decrement contours from the processed Bolocam 140 GHz data are overlaid in blue at (5, 10, 15, 20, 25)-<r on the 58" FWHM 
smoothed pseudo Compton-jy map. We note that the data are significantly shifted toward subcluster B, which would be boosted by the 
kSZE by ~ 34% assuming the optical velocity along the line of sight, v B m 3200 km/s (Ma ct al. 2009), is equal to the gas proper velocity. 
Right: Increment contours from the processed Bolocam 268 GHz observation, overlaid in red at (3, 4, 5)-cr on the 31" FWHM smoothed 
pseudo Compton-y map. The SZE increment favors the massive component C, which would be boosted by the kSZE up to 10% assuming 
the optically determined velocity along the line of sight, vq ~ —700 km/s (Ma ct al. 2009). The total SZE from subcluster B would be 
suppressed up to ~ 68% by the kSZE of this high velocity component. 



predicted by the template, and the Bolocam 140 GHz 
data are shifted ~ 20" toward subcluster B. This shift is 
significant and cannot be explained by pointing offsets; 
including the 5" intrinsic uncertainty in the CSO point- 
ing, the Bolocam centroid is determined to 8" precision. 

Looking at the 268 GHz Bolocam data (Figure [SJ 
right), we see a more profound disagreement than that 
seen in the observations of the SZE decrement. The SZE 
increment from subcluster C clearly dominates the data, 
while no flux from B is apparent at S/N > 1. While the 
lack of agreement between the MUSTANG 90 GHz de- 
tection of subcluster B and the non-detection at 268 GHz 
could be explained by filtering effects and the lower sen- 
sitivity in the 268 GHz Bolocam data, we note that the 
discrepancy between the Bolocam decrement and incre- 
ment data cannot be explained by the effects of signal fil- 
tering due to the atmospheric noise subtraction. The fo- 
cal plane geometry, scan pattern, and atmospheric noise 
subtraction are identical in the Bolocam observations at 
both frequencies, and the effects of signal filtering are 
relatively mild and approximately the same at both fre- 
quencies. While the noise level is much higher in the 
268 GHz observation than that in the 140 GHz observa- 
tion, it is also clear from the SZE observations alone that 
flux is missing from component B at 268 GHz, while B 
is significantly brighter at 140 GHz than the X-ray data 
indicate it should be. 

The properties of the SZE features observed by MUS- 
TANG in MACS J0717.5+3745 are summarized in Ta- 
ble [TJ In this table we provide the coordinates and in- 
tegrated fluxes of the SZE features in the MUSTANG 
observation. For these, we use the primary MUSTANG 
map (Figure [2] upper panel), the map with the fore- 
ground emission removed ("NW/SE sre sub"; Figure [2 
lower panel), and a map with all radio sources removed 



based on upper limits to their fluxes extrapolated from 
lower frequency radio observations ("NW/SE src+relic 
sub" ; see discussion below) . For the two main SZE fea- 
tures, called the southeast (SE) and northwest (NW) 
features, we provide estimates of the flux density from 
the raw map, from the map with the foreground radio 
source removed, and from the map with the foreground 
detected source and sources detected at lower frequen- 
cies and extrapolated conservatively to 90 GHz removed. 
We note that our flux density estimates are consistent 
for all three maps, indicating that the results are robust 
to radio source contamination. 

Table [2] reports the integrated Com pton Y t ^ 7lE = 
f ydV l computed from model fits of the lArnaud et al.l 
(2010) pressure profile to the 31 GHz CARMA/SZA and 
140 GHz Bolocam data. We find that, taken together, 
the flux in the NW and SE features, as sampled by MUS- 
TANG'S measurements (Tabled]), account for ~ 2% of 
YtszE on large scales. 

We describe below how we quantitatively compare the 
tSZE template to our SZE observations. For both MUS- 
TANG and Bolocam the pseudo Compton-y maps were 
first scaled to each instrument's o bserving fr e quenc y us- 
ing the relativistic corrections of lltoh et al.1 (jl998| ) and 
assuming the temperatures shown in Figure |3l Next, 
the maps were smoothed with the corresponding PSFs 
of each instrument, and then filtered to account for the 
signal attenuation in each instrument's data processing 
pipeline. 

4.2.1. Modelling the tSZE in the MUSTANG Data 

We fit the radio source in the MUSTANG observa- 
tion and subtract the cluster template as follows. First, 
we fit the emission from the foreground elliptical galaxy 
(detected at S/N « 13.7) with an elliptical Gaussian. 
Accounting for our transfer function, the fitted ellipti- 
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Table 1 

Properties of SZE features observed by MUSTANG in MACS J0717.5+3745. 



Region 


Fcak Location (JZUUUJ 
R.A. Dec. 


Peak /go 
(^Jy/bm) 


Peak y 




Integrated Y 

(io- 12 ) 


Tcmperature a 
(keV) 


NW b 


07:17:30.68 +37:45:38.1 


-169 + 37 


7.7 x 10" 


5 


0.99 + 0.2 


12.8+11 


NW src sub 




-164+37 


7.5 x 10" 


5 


0.98 + 0.2 




NW src+relic sub 




-163 + 36 


7.5 x 10" 


5 


0.98 + 0.2 




SE C 


07:17:33.95 +37:44:49.4 


-188 + 30 


9.0 x 10" 


5 


2.37+0.3 


34.0±ft 


SE src sub 




-207+ 33 


9.9 x 10" 


5 


2.40+0.2 




SE src+relic sub 




-206 + 34 


9.8 x 10" 


5 


2.40+0.2 





a Temperatures reported here are from Chandra X-ray spectroscopic fi ts to the regions selected by MUS- 
TANG at > 3-ct over many beams. These regions are smaller than the Ma ct al. (2009) regions, and thus 
differ from the values reported in Table[3] Due to the limited energy range available for X-ray spectroscopy 
(< 10 keV), the temperature of the hottest gas is poorly constrained. 
b Remnant subcluster core associated with Ma et al. 2009 component B. 
c CTianaira-detected hot-spot associated with [Ma et al.1 12009] components C & D. 
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Table 2 

Large Scale Thermal SZE properties of MACS J0717.5H 
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Figure 6. Left: MUSTANG map with foreground radio source subtracted (as in the lower panel of Figure [2}- The contours from the 
processed pseudo Compton-y model overlaid in red at the -102, -136, -170 /ijy beam - 1 level. Middle: Pseudo Compton-j/ tSZE template 
processed through the MUSTANG pipeline (corresponding to the contours from the left panel). Contours from MUSTANG map with 
foreground radio source subtracted are overlaid at (3, 4, 5, 6)-cr, as in the lower panel of Figure [2] Right: Residuals in the MUSTANG 
data after subtraction of the pseudo Compton-j/ map and radio point source model from the time ordered data (i.e. the scans used in 
map-making). Contours are (3, 4)-ct. The point source is cleanly modelled and subtracted, while the X-ray pseudo Compton-y map leaves 
significant (> 4-<r) residual flux on small scales, potentially due to filtering effects in the model (e.g. discontinuities at the temperature 
region boundaries), the limited X-ray temperature resolution in our fc B T c map, or the complicated line of sight structure of the cluster. See 
text for details. 

pact sources extrapolated from 610 MHz GMRT and 
1.4 & 5 GHz VLA data, using a constant power law 
extrapolation (a — —1.25 for the extended feature, 
and a = —0.7 for the compact radio sources). In- 
cluded with this extrapolation was the detected fore- 
ground source. Where as sociatio ns can be made wit h 
FIRST (I White et alJll99l ■ NVSS (ICondon et al.lH998jh 
OVRO/BIMA (|Coble et al1l200l . or our CARMA/SZA 
data, we use the measured flux densities to help constrain 
the spectral index of the source. We also tested sub- 
traction of the extended radio feature using a = —1.4. 
In both cases, the extrapolated sources had a negligi- 
ble impact. Our spectral extrapolation should place a 
conservative upper limit on the radio source flux densi- 
cal Gaussian is broadened to 14.4" x 16.1" FWHM. The ties at 90 GHz ' Synchrotron sources, in general show 
cluster template is held fixed, and both the foreground steeper sp ectra at higher frequencies due to radiative 
elliptical galaxy and tSZE template were subtracted to losses (e.g-I Canlh et al|1991] | Cotton et al. || 2009| , and re- 
produce the maps of the residuals shown in Figure © e l en ^ es herein). For the inferred ~ 3 M G magnetic field 
We also subtract the extended radio feature and com- of the tended halo/relic component (|Bonafede et al. | 



Instrument 


Frequency 
(GHz) 


Centroid (J2000) 
R.A. Dec. 


y 5 oo a 

(io- 10 ) 


SZA 


31 


07:17:30.4 +37:45:25.9 


2 04+ - 20 


Bolocam 


140 


07:17:31.9 +37:45:20.5 


2 44+ - 50 



a Using rsoo from Mauehan ct al. (2008), where rsoo = 1-36 Mpc 
is the radius within which the average density is 500 times the 
critical density of the universe at that redshift. This corresponds 
to 0500 = 3.5'. The va lues of Y500 were determined by fitting 
the [X rnaud ct al. (2010) p ressure p r ofile t o the data, as in e.g. 
IReese et al.l (120121 ) and ISavers etal] <20TlD . 
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2009), the radiative lifetime of the electrons giving rise 
to emission at 90 GHz is ~ 28 Myr. If this population is 
older, then the spectrum i s likely to be st e eper th an we 
have assumed. Recently, iMarriage et al.1 (|2011bl ) mea- 
sured a steepening on average of radio source spectra 
above 20 GHz. We find that the contribution from the 
detected and undetected, extrapolated radio sources has 
a negligible impact on SZE flux measured at 90 GHz in 
these MUSTANG maps (see Table [J). 

After subtraction, we find a 4-er residual flux decre- 
ment associated with the interactions of subclusters C 
and D, located between the strong lensing peaks. This 
southeast peak (SE) is likely due to the shock-heated 
gas produced in their merger, as indicated by the X-ray 
data and supported by the presence of extended radio 
emission to the north. Restricting our X-ray spectral 
extraction to the region selected by MUSTANG at 3-a, 
contains ~ 3000 photon counts, we re-fit the X-ray data 
and find a temperature T c = 34+y 1 4 keV (see Tabled]), 
up from T D = 24.1+g'g keV for the contour binned map. 
If we further restrict our X-ray spectral extraction to the 
region selected by the residuals in the right-hand panel of 
Figure [6j we find that we cannot place meaningful con- 
straints on its temperature, despite having rj 1200 X-ray 
photon counts. This is presumably due to the tempera- 
ture in that region being above Chandra's energy range. 
Since the tSZE is linear with temperature, such a high 
temperature would increase the expected signal over that 
in the pseudo Compton-y map by ~ 40% from this re- 
gion, and would account entirely for the tSZE residual 
we found. 

We also find residual flux south of subclustcr B. This 
residual is of similar significance as the residual near sub- 
clusters C and D. Extracting rj 1000 photon counts from 
the Chandra data for the region selected by this residual, 
we find a temperature T = 14. 2+32 keV. This is within 
the errorbars for the entire region selected by MUSTANG 
(see Table [T|), while the region's X-ray surface bright- 
ness is much lower than that within region B. We there- 
fore interpret the residual south of region B to be largely 
an artifact of the subtraction of the pseudo Compton-y 
template. While the X-ray information aids the inter- 
pretation of our high-resolution SZE data, the residual 
substructure could be due to a number of systematics, 
discussed below. 

Foremost, the binning in the temperature maps pro- 
duces large discontinuities from region to region. When 
high pass filtered, this can introduce a ringing effect in 
the model image (see middle panel of Figure EJ . This 
seems to be the case for the region just to the south of 
subcluster B. A small, 3-cr feature appears to be boosted 
by the positive ringing of an otherwise negative template. 
Improvements to the modelling and generation of pseudo 
Compton-y templates will be explored in a future work 
including relaxed clusters with deeper X-ray data. We 
note that the X-ray temperature constraints in our k B T B 
map come at the cost of reduced resolution in the result- 
ing tSZE template. Many of the temperature bins are 
~ 1 arcmin 2 in area, while the resolution of the MUS- 
TANG map is 13". Therefore, some of the larger tem- 
perature excursions due to shock heating on small scales 
could be missing from our X-ray k B T B map. 

Errors in the X-ray derived temperatures as high as 



~ 25% shown in Figure 0] give rise to two additional 
sources of uncertainty in the tSZE template. The dom- 
inant one is simply that pressure scales linearly with 
temperature, while the additional error due to using 
the wrong temperature when computing the relativis- 
tic correction to the thermal SZE at 90 GHz anticor- 
relates with temperature and is on the « —3% level for 
AT B /T B ~ +0.25. Additional sources of possible system- 
atic error that could lead to discrepancies between the 
X-ray prediction and SZE measurements are clumping 
and temperature substructure (discussed in §4.ip and the 
breakdown of the assumption that the effective electron 
depth £ is constant (see Equation \S§ . 

4.2.2. Modelling the tSZE in the Bolocam Data 

For Bolocam we fit the filtered pseudo Compton-?/ map 
to our SZE data using a least-squares method and al- 
lowing only the normalization of the map to float. In 
performing the least squares fit we weight the map pix- 
els under the assu mption that the ma p noise covariance 
is diagonal, as in I Savers et al.l (|2011l ). However, since 
the diagonal noise covariance assumption is not strictly 
correct, we estimate our derived parameter uncertainties 
and fit quality via simulation using 1000 statistically- 
indepe ndent noise realizat ions, as described in more de- 
tail in ISayers et al.l (|2011| ). We note that by performing 
the fits in this manner, our parameter uncertainties do 
not rely on our assumption that the noise covariance is 
diagonal. 

Due to limitations in the extent of the pseudo 
Compton-y maps, which require accurate temperature 
determination from X-ray data, the fit was restricted to 
the inner 4' x 4' region of our Bolocam data. We find best- 
fit normalizations of 1.043 ± 0.092 and 0.839 ± 0.503 at 
140 and 268 GHz, indicating that the pseudo Compton-y 
maps are consistent with the total integrated SZE sig- 
nal in the Bolocam data at both observing frequencies. 
This is expected for the 140 GHz data since the pseudo 
Compton-y map was normalized so it would have the 
same F tsZE as that found in fits to the Bolocam 140 GHz 
data. However, the fit quality is actually quite poor at 
140 GHz, which we test by computing the value of x 2 to 
determine the probability to exceed (PTE), which is the 
probability that we would obtain a fit with a larger value 
of x 2 due to a random noise fluctuation^ For the fit to 
the 140 GHz data, we find x 2 /DOF = 199/143, with a 
PTE = 0.1%, indicating that the pseudo Compton-y map 
does not adequately describe our 140 GHz data (note 
that the fit quality is acceptable for the 268 GHz data, 
with x 2 /DOF = 624/575 and PTE = 22%). Based on 
the opposite signs of the residuals in units of Compton- 
y (see Figure [7]), we identify a spectral dependence that 
suggests the presence of a kinetic contribution to the SZE 

4 Note that we quote values for the PTE based on the fraction 
of noise realizations with larger \ 2 values to identical fits, rather 
than quoting value s based on the standard x 2 distribution. In 
ISavers et all (20TlL we found that the distribution of \ 2 values 
from our noise realizations closely matched the theoretical distri- 
bution, and we again find that to be the case for the 140 GHz data. 
However, for the 268 GHz data presented here there is a noticeable 
difference, with our noise realizations producing a distribution of 
X 2 values slightly higher than the theoretical prediction. This dis- 
crepancy is likely due to the larger amount of atmospheric noise in 
the 268 GHz data. 
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Figure 7. Bolocam map, processed tSZE template (model), and residuals. Upper panels are for the 140 GHz data, and lower panels are 
the corresponding plots for the 268 GHz data. 

Upper Left: Bolocam 140 GHz map, smoothed with a 60" FWHM Gaussian. Contours are overlaid at 3-cr intervals. Upper Middle: 
Pseudo Compton-j/ (tSZE) model at 140 GHz, processed through the same pipeline as the data, so that the filtering effects are the same 
for the model and data. The normalization was allowed to vary to find the best-fit value for the 140 GHz dataset. Contours are overlaid 
at 3-cr intervals. Upper Right: Bolocam 140 GHz residuals after the best-fit pseudo Compton-j/ model is removed. Contours are overlaid 
at 1-tr intervals. Note the color scale was changed to enhance residuals. 

Lower Left: Bolocam 268 GHz map smoothed with a 30" FWHM Gaussian. Contours are overlaid at l-cr intervals. Lower Middle: 
Same as upper middle, but for the 268 GHz data. Contours are overlaid at l-cr intervals. Lower Right: Same as upper right, but for 
the 268 GHz data. Contours are overlaid at l-cr intervals. Note the color scale is unchanged for the 268 GHz plots due to the higher noise 
level. 

signal, which we show in £j5]to improve the fit quality for 
both data sets. 
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5. INFERRED PECULIAR VELOCITIES 

The residuals of the tSZE template fit, shown in Fig- 
ure [3 indicate the reason for the poor fit quality of our 
thermal SZE template to the Bolocam 140 & 26 8 GHz 
data. With the best-fit normalization given in §4.2.2( 
the pseudo Compton-y map clearly under-predicts the 
140 GHz signal near component B and over-predicts the 
signal near components C and D. Although the 268 GHz 
data are noisier, the opposite is true; for those data, the 
pseudo Compton-y map over-predicts the 268 GHz sig- 
nal near component B and under-predicts the signal near 
components C and D. Motivated by these results, along 
with the known, large velocity of the subcluster B, we 
consider the kinetic SZE as a possible explanation of 
the opposite discrepancies between our 140 and 268 GHz 
data and the pseudo Compton-y map. The intensity of 



A/ k szE — —IoT e 



x 4 e x 



c (e x - If 



[l + <SkszE(T„, (5) 



where v z is the line of sight proper velocity (positive for 
a subcluster receding from the observer), r e = J n c a T d£ 
is the electron depth, x = hv/(k B T CMB ) is the dimension- 
less frequency, and 5 kSZE (T e ,x,v z ) is a sm all relativistic 
correc tion which we compute according to Nozaw a~et al.l 
(12006ft . It can be seen from Equation[2]that under the as- 
sumption of constant temperature along the line of sight, 
r e oc y/T B . For fixed Compton-y, the kSZE will be sup- 
pressed at higher temperature. 

Since we lack precise knowledge of the shape of the pos- 
sible kSZE signal sourced by component B, we chose to 
describe it using a Gaussian profile with a 90" FWHM0 

5 We varied the FWHM of the profile between 30 and 120" and 
found a broad maximum in the fit quality at both 140 and 268 GHz, 
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Figure 8. Bolocam map, processed models (tSZE + kSZE Gaussian), and residuals. Upper panels are for the 140 GHz data, and lower 
panels are the corresponding plots for the 268 GHz data. Black, dashed circles in middle and right panels show location and size of added 
90" FWHM Gaussian model. 

Upper Left: Bolocam 140 GHz map, smoothed with a 60" FWHM Gaussian. Contours are overlaid at 3-cr intervals. Upper Middle: 
Model superposition of the best fit kSZE + pseudo Compton-j/ template, processed through the same pipeline as the dataset it was used 
to fit. The normalizations of both models were allowed to vary in order to find the best fit for the dataset. Contours are overlaid at 3-cr 
intervals. Upper Right: Bolocam 140 GHz residuals with full model (kSZE + pseudo Compton-y) removed. Contours are overlaid at 1-cr 
intervals, and the colormap scale was changed to enhance residuals. 

Lower Left: Bolocam 268 GHz map smoothed with a 30" FWHM Gaussian. Contours are overlaid at 1-cr intervals. Lower Middle: 
Pseudo Compton-y (tSZE) model at 268 GHz, processed through the same pipeline as the data, so that the filtering effects are the same 
for the model and data. The normalizations of both models were freely varied. Contours are overlaid at 1-cr intervals. Lower Right: 
Bolocam 268 GHz residuals with full model (kSZE + pseudo Compton-y) removed. Contours are overlaid at 1-cr intervals. Note the color 
scale is unchanged for the 268 GHz plots. 

is 1.78 ±0.68. We again find that including the Gaussian 
clump improves the fit quality (x 2 /DOF = 593/574 and 
PTE = 50%). The combined models and residuals are 
shown in Figure [5] 

Using the results of our model fits to compute flux den- 
sity, we attempt to fit for a tSZE+kSZE spectrum that 
can adequately describe subcluster B, the highest veloc- 
ity member of MACS J0717.5+3745. For comparison, 
we also fit the spectrum of subcluster C, since it is the 
most massive component, it has a lower known optical 



We then re-fit our 140 GHz data with a model consist- 
ing of the pseudo Compton-?/ map and the Gaussian 
source centered on component B, allowing the normal- 
ization of each component to vary. The inclusion of 
this Gaussian clump significantly improves the fit quality 
(X 2 / D F = 153/142 and PTE = 24%), indicating both 
that this Gaussian clump is justified statistically and that 
the co-added model is adequate to describe our data. 
Jointly fit with the Gaussian clump, the normalization 
for the pseudo Compton-y map fit to the 140 GHz data is 
0.768 ± 0.084, indicating the electron depth I ~ 0.8 Mpc 
(see £|4.1[) . We then fit the same Gaussian model to our 
268 GHz data, allowing the normalizations of it and of 
the pseudo Compton-y map to float. For this fit, the nor- 
malization for the tSZE template fit to the 268 GHz data 

centered near 90" . We therefore left the FWHM fixed at that value 
for the fit to the Bolocam data at both 140 and 268 GHz. 



velocity (« —700 km s ), and it is detected at 5-<r in 
the 268 GHz data. We also verify our mo del- dependent 
results using non-parametric, model-independent esti- 
mates of the flux densities from the deconvolved images 
for each of these regions. For all combinations of methods 
and regions, we use the integrated flux densities obtained 
from within 1' diameter apertures centered on subclus- 
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Table 3 

X-ray temperature and inferred peculiar velocities 



Subclustcr 


(keV) 


Av 3 - 


Sl40 GHz 


5*268 GHz 




Name 


(km/s) 


(mjy) 


(mjy) 


(km/s) 



B (model) b 


i3.7t?;S 


+3238^242 


-19.5 ± 2.0 


1.9 ± 5.8 


+3600^ 


B (nonpar) 






-17.0 ±2.1 


-0.5 ± 7.5 


+46401^ 


C (model) b 


24.i^; 1 5 


7 oo+486 
'°°_478 


-14.7 ± 1.6 


18.5 ± 7.5 


o 79n +2960 
•J' zu -2480 


C (nonpar) 






-13.0 ± 1.7 


17.7 ± 8.5 


-4120±iS|g 



a Optical velocity from lMa efaH J2009T) . 

b Derived using fluxes computed from combined fit using a superposition of the 
tSZE pseudo Compton-y template and the Gaussian clump component described 
in the text. 

° Derived using non-parametric flux measurements from aperture photometry. 



ter B or C. The values are listed in Table [3] We perform 
a grid search over the possible velocities, temperatures, 
and values for Y tsZE in each region. We include the X- 
ray k B T c constraints for each region when we compute 
X 2 for each combined, relativistically-corrected thermal 
+ kinetic SZE spectrum that can describe our measure- 
ments. 

Our joint constraints on velocity and Y tsZE for these 
regions, using both the model dependent and non- 
parametric fluxes, are shown in Figure |H1 Also shown 
in Figure [9] are the 1 and 2-a constraints on each param- 
eter when marginalizing over the other (the A\ 2 < 1 and 
Ax 2 < 4 regions for projection to the axis of interest). 
Note that the values of Y tsZE and peculiar velocity are 
anticorrelated, leading to a wide possible range for both 
when fitting only two bands of SZE data. Marginaliz- 
ing over Y tsZE for each region, the median and 1-a er- 
rors on velocity are reported in Table |3l The inferred 
kSZE+tSZE spectra are plotted in Figure [TDJ We find 
that the probability of v z < is 2.1% for the fits to the 
model-derived flux density for subcluster B, and 3.4% 
for the non-parametric flux density estimates. For sub- 
cluster C, we find the probability of v z > is 15.7% 
for the model flux density estimates, and 19.3% for the 
nonparametric flux density esti mates. All of our peculiar 
velocity estimates agree with the lMa et al.l (|2009f) optical 
estimates to within 1-a. 

The possibility of using the CARMA/SZA 31 GHz 
data, which should be relatively insensitive to the kSZE 
contribution, to better constrain the tSZE component 
was considered. However, the synthesized beam in our 
CARMA/SZA 31 GHz observations was over 2', while we 
are fitting components at the arcminute scale, and would 
thus make the results difficult to compare directly. 

Our measurements of the SZE spectra using 140 and 
268 GHz Bolocam data are sensitive to several possi- 
ble systematic errors, which we describe here. Errors 
due to the relative flux calibration were included in our 
flux estimates. Since the regions used in the analysis are 
60" in diameter, and the pointing information in each 
dataset is accurate to better than 5", this result cannot 
be explained by pointing errors. We also consider possi- 
ble contamination from dusty, star forming "submillime- 
ter" galaxies (SMGs) which have been included in our 
noise model in a statistical sense, though bright and/or 
lensed SMGs could bias our measurements, particularly 
at higher frequencies. The direction of the submillimeter 
contamination is key: deviation from a purely thermal 



SZE spectrum from B requires a negative flux density at 
both 140 and 268 GHz, so (positive) contamination from 
SMGs would cause us to underestimate the kSZE signal 
rather than overestimate it. The inferred large, negative 
proper velocity of subcluster C could potentially be af- 
fected by a SMG, but is nevertheless consistent with the 
optical velocity to better than 1-a. Importantly, submm 
Herschel maps show no indication of contamination at 
a significant level in either region. The extended radio 
emission near C - too faint at 90 GHz to be seen in the 
much more sensitive MUSTANG observations - cannot 
explain the kSZE component of subcluster C, nor can 
compact radio sources, constrained by the MUSTANG 
observation to be < 90 /zJy beam -1 at 90 GHz. Finally, 
temperature substructure due to clumping, merger activ- 
ity, or the remnant core in B could increase the variance 
in our estimates, but since there is no way to constrain 
the size of this clumping effect with these data we defer 
to future work. 

6. CONCLUSIONS 

High resolution, multi-wavelength observations of the 
SZE are now beginning to offer measurements that are 
truly complementary to X-ray and optical studies of the 
complicated dynamics in galaxy clusters. Here we have 
presented sensitive, sub-arcminute measurements from 
MUSTANG at 90 GHz and from Bolocam at 140 and 
268 GHz. We compared these with lower resolution SZE 
observations obtained with CARMA/SZA at 31 GHz. 
We also compared our SZE observations to the detailed 
lcnsing, optical dynamics, radio, and our own results us- 
ing Chandra X-ray data to build a 2-dimensional tem- 
plate for modelling the thermal SZE in this cluster. 

The primary feature in MUSTANG'S high-pass- 
filtered, high-resolution view of the cluster seems to be 
associated with the merger activity between two sub- 
cluster components (C & D in the Figure [l}. This fea- 
ture is also strong in Bolocam's 268 GHz, 31"-resolution 
map of MACS J0717.5+3745, and is associated with 
the hottest gas, which approaches ~ 30 keV in spec- 
tral fits to the Chandra X-ray data from that region of 
the sky. The feature is bracketed by non-thermal, ex- 
tended emission from the relativistic gas seen in GMRT 
610 MHz and VLA 1.4-5 GHz observations, providing 
further supporting evidence for a merger scenario and a 
multi-phase intra-cluster medium. The MUSTANG ob- 
servation also reveals significant features at subcluster 
B which could include contributions from high temper- 
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Figure 9. SZE constraints on peculiar velocity and YtszE from the Bolocam 140 and 268 GHz data. Each sub-cluster was assumed to 
be isothermal, and X-ray spectroscopic contraints were used to marginalize over temperature. Yellow regions contain Ax 2 < 2.3 (68.3% 
confidence interval on both parameters), and green regions contain Ax 2 < 6.17 (95.4% confidence interval on both parameters). Black 
dashed lines enclose Ax 2 < 1 and Ax 2 < 4, the 1 and 1-a confidence intervals on the parameters taken individually (i.e. marginalizing over 
the other parameter). An 'X' marks the minimum in x 2 - The horizontal red dashed lines mark l-cr constraints from the Ma ct al. (2009) 
optical spectroscopic velocity measurements. Upper Left: Constraints on peculiar velocity and YtszE using the fluxes measured from the 
model fits within a 1' diameter region centered on subcluster B. Upper Right: Same as upper left, but using the nonparametric flux 
measurements. Lower Left: Same as upper left, but for subcluster C. Lower Right: Same as lower left, but using the nonparametric 
flux measurements. 



ature or density substructures. MUSTANG'S measure- 
ments account for ~ 2% of the integrated pressure on 
large scales, as measured in the Bolocam 140 GHz and 
CARMA/SZA 31 GHz data. 

Using the X-ray data to constrain the integrated line of 
sight pressure, and normalizing it to the large scale SZE 
observations, we constructed a 2-dimensional template 
for the thermal SZE in this cluster. The use of the bulk 
SZE measurements for normalization allows us to convert 
the X-ray pseudo-pressure map into units of Compton-y, 
and yields a value for the effective depth of the ICM that 



is consistent with the cluster's scale (~ 1 Mpc). While as- 
sumptions about the line of sight structure must be made 
and the model is imperfect, a simple spherical model for 
this unvirialized structure is clearly insufficient. The de- 
tailed comparison of these tSZE templates with the SZE 
data allows us to infer the presence of pressure substruc- 
ture not constrained by X-ray observations alone. Fur- 
ther, we presented for the first time subtraction of com- 
pact radio source contamination from the MUSTANG 
time-ordered data used in making our maps. 
By subtracting the radio source contamination from 
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Figure 10. Upper Left: Spectral sum of tSZE and kSZE flux densities for subcluster B, using the measurements obtained from the 
model fits to the Bolocam 140 & 268 GHz data (black points with error bars). The best fit combined SZE spectrum is plotted as a solid 
blue line, with 1-a errors displayed as the cyan region. SZE spectral fits were obtained through a joint likelihood analysis for the Bolocam 
data including Chandra k B T c likelihood constraints. The kSZE contribution for the best fit velocity in Table [3] is plotted in magenta (solid 
line). A pure tSZE spectrum to the data is plotted in red (solid line, with dashed lines indicating the 68.3% confidence interval). Spectra 
with v z < have a probability of 2.1% given these data. Upper Right: Same as upper left, but using the flux density measured directly 
(nonparametrically) at 140 and 268 GHz from the deconvolved data. Spectra with v z < have a probability of 3.4% given these data. 
Lower Left: Same as upper right, but for subcluster C. Spectra with v z > have a probability of 15.7% given these data. Lower Right: 
Same as upper right, but for subcluster C. Spectra with v z > have a probability of 19.3% given these data. 



the MUSTANG data, and the thermal SZE template 
from both, our observations revealed residuals indicating 
that the X-ray-derived tSZE template provides a poor 
fit that can only qualitatively describe the data. For the 
MUSTANG data, the residuals indicate significant pres- 
sure or temperature substructure not seen in the X-ray 
(e.g. out of band hot gas), but could also be due to a 
number of systematics (e.g. clumping, assumption of a 
constant line of sight depth, temperature substructure, 
or filtering and data processing effects). 

For the Bolocam data, the residual component at sub- 
cluster B after subtraction of the template is negative in 
intensity both below and above the null (~ 220 GHz) 
in the thermal SZE spectrum. The residuals are there- 
fore inconsistent with a purely thermal SZE component 
- either as an excess or deficit - or any possible com- 
pact radio or submillimeter source. Further, considering 



the high velocity (« 3200 km s" 1 ) of subcluster B, we 
find this residual to be consistent with the kinetic SZE. 
We note that this measurement is on resolved, subcluster 
scales, rather than being due to the proper motion of the 
cluster as a whole. 

Using flux densities extracted from our model fits, and 
marginalizing over the X-ray spectroscopic temperature 
constraints for the region, we found that the high- velocity 
subcluster B has a best-fit line of sight proper velocity of 
3600^16° km s" 1 . This agrees with the optical velocity 
estimate for the galaxies associated with the subcluster. 
While our results depend on assumptions about the line 
of sight temperature structure and the accuracy of the 
X-ray temperature determination, we find that the prob- 
ability v z < given our data is 2.1%. We also fit the SZE 
spectrum of the most massive subcluster, C. For this, we 
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found -3720^2480' with a 15 - 7% probability a v z > 
SZE spectrum can describe the region given our data 
and assumptions. 

We also compared our peculiar velocity estimates with 
spectral fits using the fluxes measured nonparametrically 
from the same regions of the maps, and find the proba- 
bility a v z < SZE spectrum can describe our data for 
subcluster B is 3.4%. For region C, the probability is 
19.3% that v z > given our data. 

This tantalizing result is among the highest signifi- 
cance indications of non-zero kine tic SZE from an indi- 
vidual galaxy cluster vet Csee e.g. [Holzapfel et al.l [19971: 
Benson et all 120031: iMauskopf et al.l l2Q12t iZemcov et al.l 
20121) . Even more exciting is that the kSZE spectral com- 
ponent required is on subcluster scales. Clearly, this clus- 
ter will be an interesting target for both detailed mod- 
elling of the dynamics and for future SZE studies. 
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APPENDIX 

A simple justification for the "slab approximation," which assumes the temperature is constant along the line of 
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sight, is as follows. The expression for X-ray surface brightness is 

^ = 47r(1 1 +z)3 /A e m^)^, (i) 

where A ee (T ,Z) contains the extra (1 + factor required by cosmological dimming. In the absense of any true 
line of sight information about the cluster, we defer to the standard spherical approximation. Assuming a standard (3 
profile for the density, 

n c (r)=n c0 [l + (r/r c ) 2 ]- 3P/ \ (2) 
then surface brightness as a function of sky angle 9 is (e.g. iSara zin 1988]; iReese et"aLll2002l : ILaRoque et al.l[2006h 

s^e) = s^[i + {6/e c ff- m/2 . (3) 

For pressure - allowing for the slope f3p to be different from the density profile's slope (3 (i.e. non-isothermal ICM) - 
we have 

P c (r)=P cO [l + (r/r c f]- 30p/2 . (4) 

Using y = a T /(m B c 2 ) J P e dt, Compton-y as a function of sky angle 9 is (see e.g. IReese et~aLll2002t ILaRoque et al.l 
120061) 

y(9) = y a [l + (9/6 c ff- 3M/2 . (5) 

Taking (3 — 0.7 (e.g. ILaRoque et al.l 120061 ) as the average density slope and (3p — 0.86 for the average pressure slope 
(see e.g. iPlagge et aLll2010t iMarriage et~aTll2011al; IReese et al.ll2012D . the ratio of y(9) to y/S x (0) is now proportional 
to [1 + (9/9 c ) 2 ] raised to a power of (1 - 3/3 P )/2 - (1 - 6/3)/4 = 3(/3 - (3 P )/2 + 1/4 = 0.01 « 0. The assumption that 
the slope is (i.e. the slab approximation is valid) is within the error bars reported on the average slopes (3 and (3p, 
and prevents the resulting pseudo Compton-y map from becoming unbounded at the map edges. 

We emphasize that this result does not imply pressure is constant along the line of sight, but rather that the average 
ratio of Compton-y to \/~S^ is approximately constant. The underlying pressure and density profiles are assumed, on 
average, to be described by their respective /3-model parameterizations (/3 = 0.7 and [3p — 0.86). If we were to assume 
(3 = 2/3 for the density profile instead, this would yield a /3-model taper to the power of -0.14. Both of these choices 
of f3, along with (3p = 0.86 , are consistent wi t h a polytropic inde x of 1.2-1.3, which are in turn consistent with values 
reported in recent studies (jBautz et al.ll2009t iCapelo et al.ll2012f) . 
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